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A statistical nodel for analysis of multiple 
time-S€»ries observation is briefly outlined. The model incorporates a 
change parameter corresponding to intervention or interruption of the 
dependent series. The additioni^l time**series are included in the 
model as covariates. The practj,cal application of the procedure is 
illustrated with traffic fatality data from several Northeast states. 
The intervention variable is a legislative crackdovn on speeding 
offenders in Connecticut, vhich vas claimed to have produced a 
significant drop in the traffxc fatality rate. The analysis using 
neighboring states as covariates results in a change to 
nonsignif icance for the intervention effect. The seasonal nature of 
the data also illustrates the applicability of the procedure vhen 
seasonality is present, so that no season adjustments are required. 
(Author) 
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The quasi-experimental time-series design noted by Campbell and 
Stanley (1963) has been examined by a series of authors in the last few 
years. Box and Tiao (1965) derived a statistical method for estimating 
intervention effects in time-series using a discrete stochastic model, the 
integrated-moving averages (IMA) process.^ Campbell and Ross (1968) dis- 
cussed suoh an analysis applied to time-series of traffic fatalities in 
Connecticut between 1951 and 1960. Glass, Willson, and Gottman (1972) 
considered the methodology to date and produced an initial attempt to 
organize the extant work on the time-series experiment problem. The 
present paper reports some additional theoretical work and provides a 
methodological example in which some time -series variables act as covariates 
on a dependent variable. 

Box and Tiao (1965) utilized the simplest non-stating stochastic IMA 
process, IMA (1,1): 



\ " ^-1 = "^Vi -^^^ 



(1) 



where z^ is an observation of the dependent variable at time t, a^ is an 
unobserved random variable uncorrelated over time, and 0_ is a coefficient of 
autocorrelation which takes values between -1 and 1. Nonzero values of 0 



^Paper presented at the Annual Meeting of the American Educational 
Research Association, Chicago, Illinois, April 19, 1974. 



'See Box and Jenkins (1970). 
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produce the characteristic autocorrelation of z found in time-series. If 
observations of were not correlated, ordinary parametric statistics could 
be used to evaluate interventions In a time-series. Scheffe (1959) and 
others have pointed out the serious effects of autocorrelation upon sampling 
distributions of t-statistlcs, making usual normal parametric tests unusable* 
Box and Tiao (1965) derivec^ a method for transforming away the autocorrela- 
tion. They noted that the model in (1), may be rev/ritten as 

t-1 

2. = L + (1 - 9) Z a + a , (2) 
^ 1=1 ^ ^ 

where L is a location parameter for the first observation (time-series of 

this sort do not have expected values) • The variable was shown to be 

transformable to ^9 

which is In the form of the general linear model (let -0 = 1 - X) , 

» (1 - + a^, (4) 

or 

Yj. = X6 + a. (5) 

With suitable dlstrlbut'.onal assumptions about £ (normal. Independent and 
Identically distributed with finite variance), the ^ Is estimable in terms 
of a design matrix X and observation Y under least-squares estimation. If 
the intervention of the time-series experiment, 6, is Included with equation 
(2) after n^ observations, 
t-1 

2. •= L + X Z a. + 5 + a , (6) 
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Then equation (4) becomes 

t-1 t-n,-l . 

» (1 - X) L + (1 - X) 6 + a^, (7) 

and both L and S may be estimated by method of least-squares. The value for 
X has not been specified. Box and Tlao (1965) suggested that an Iterative 
method be used (calculate estimates for valae of X between :he theoretical 
limits 0 and 2), using a minimum variance criteria or a Bayeslan maximum 
likelihood function on the residual variance. 

Campbell and Ross (1968) discussed the time-series analysis of data on 
traffic fatalities In Connecticut between 1951 and 1960. An Intervention 
effect was hypothesized to have occurred on January 1, 1956, based upon a 
"crackdown" on speeding offenders ordered by Governor Abraham Rlblcoff of 
the State of Connecticut on December 23, 1955. The Governor's action required 
suspension of driving licenses for persons convicted of speeding offenses. 
First offenders lost their licenses for thirty days, second offenders lost 
theirs for sixty days, and third offenders had their licenses suspended 
Indefinitely. Inspection of the traffic fatality rate per 100,000,000 
driver miles shows a large drop In the rate between December, 1955, and 
January, 1956 figures. This was cited by the Connecticut State Administra- 
tion as evidence of the effectiveness of the "crackdown" action. 

Analysis of the Connecticut data In which seasonal variations were 
removed showed a significant difference from zero for the hypothesized inter- 
vention. Similar data for nearby states — Massachusetts, Rhode Island, New 
York, and New Jersey — were examined, and similar drops in level were noted 
in Massachusetts, New York, and New Jersey. This evidence was presented as 
disconflrmation of the hypothesis that the crackdown produced a significant 
drop in the level of Connecticut fatalities per 100,000,000 driver miles (see 
O Glass, 1968). 
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Glass, Willson, ard Gottman (1972) noted that the analysis of multiple 
time-series may be considered a multiple-group, multiple-intervention design. 
A multiple comparisons analvsis of the estimated intervention effects observed 
in the five states was presented. A planned orthogonal comparison between 
the Connecticut change in level and the mean of the changes in level for the 
other four states results in a t-statistic whose probability of occurrence 
by chance is .074. The data were seasonally adjusted by subtracting the 
monthly average for ten years from each month's value. 

Several difficulties appear in this analysis. First, it is not clear 
that the same problem Is being addressed when seasonal adjustments are made. 
Although traditional in economics, the series observed after adjustment are 
not the same as those observed before in terms of X (see Willson, 1972) . 
Also, the common design question arises: does one assign a variable as a 
factor in the design or covary on the variable? Initial differences on the 
variable are cited as a reason for ANCOVA. Willson (1973) derived a statis- 
tical treatment for covariation in time-series. The basic assumption is that 
the random variable a^ is not truly random but is correlated with the covariate 
c^* This may be checked statistically. If significant correlation is observed, 
a^ is decomposed into two components, 

a^ = ^(c^ - c) + e^, (8) 

when c" is the observed mean of £ prior to intervention, and in which e^ is now 

identically, independently distributed (normality is assumed for estimation 

of confidence intervals). The result is that instead of equation (7), 

t-1 t-n- -1 

y^ = (1 - X) L (1 - X) ^ 6 + *(c^ " <r) + e^, (9) 

so that now L, 6, and ({> are estimated. Multiple covariates may be added much 
as in multiple ANCOVA. With this procedure, the Campbell and Ross data were 
reanalyzed as an ANCOVA problem. 



The Rhode Island data are Ignored since a good case may be made for 

noncomparablllty of population and miles driven with respect to the other 

states 9 resulting in unstable estimates for the states. The original data 

are examined without removal of the seasonal variation. The data for three 

3 

states are graphed in Figure 1« The seasonal components are hypothesized 
not to affect estimations since regression coefficients and the least-squares 
procedures are based on deviations from the mean level, which will be matched 
for different variables on the seasonal component. 

The intercorrelation matrix of lag zero first differences is presented 
in Table 2 for the raw data on fatality rates per 100,000,000 driver miles 
for the four states under consideration* Zero lag is chosen since it may 
be assumed that the causes of traffic fatalities operate simultaneously for 
all the states in a similar fashion* 



TABLE 1: Intercorrelation Matrix of Fatalities Per 100,000,000 
Driver Miles for Four States for Lag Zero First Difference Data* 





Connecticut 


Massachusetts 


New York 


New Jersey 


Connecticut 


1.00 


.22 


.18 


.38 


Massachusetts 


,22 


1.00 


.15 


.22 


New York 


.18 


.15 


1.00 


!56 


New Jersey 


.38 


.22 


.56 


1.00 



New York was ignored; explanation is given In the next paragraph. 

Variables may be regressed on time - lagged data* Here the correlations 
showed no significant lagged correlations so that regression was contempora- 
neous in time* First differences (z^ - ^t-1^ calculated because the 
model is an IMA (1,1) (Campbell and Ross, 1968)* 
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TABLE 2 : Estimates of Parameters for Analysis of Connecticut Traffic 
Fatalities per 100, 000, 000 Driver Miles With and Without 
• Massachusetts and New Jersey Traffic Fatalities per 
100,000,000 Driver Miles as Covariates. 



Pre-Intervention Data: 60 Observations by Month (January, 1951- 

December, 1955) . 
Post-Intervention Data: 48 Observations by Month (January, 1956- 
December. 1960), 



With Covariate 



Without Covariate 



i = 3.39 
SE(£) = .11 
6 = -.22 
SE(5) = .17 

= .522 



. 2 
Conn 



=.790 

Mass 

=. 399* 

Mass 
SE(*^_j_) = .106 



£ = 3.57 



SE(L) = .12 

8 = -.61* 

SE(6) = .18 

?^ =.831 
Conn 



* 

(p < .05 for significance from zero). 
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The variables associated with New Jersey and Massachusetts are most 
highly correlated vlth the Connecticut variable, and the two are chosen for 
the analysis as covarlates,^ The estimates of parameters with and without 
covarlates are given In Table 2* 

Comparison of the mean levels with and without covarlates shows that 
the two estimates are similar In value. The change In level due to the 
Rlblcoff Intervention Is significantly different from zero In the Connecticut 
data without covarlates, but the Intervention Is reduced to a nonslgnlf Icance 
when covarlates are Included > The regression coefficients for New Jersey 
and Massachusetts fatality rates are both significantly different from zero 
(p < .05) when the covarlates are Included In the analysis. 

The main point Is that the Intervention Is reduced to nonslgnlf Icance 
with the Inclusion of the covarlates. This Is the same conclusion reached 
by Campbell and Ross (1968), and by Glass (1968), but the conclusion Is 
based upop simultaneous mathematical analysis of the four-variable system 
(lndepende.it, dependent, and two covarlates) rather than upon separate 
analyses. This procedure seems preferable to au approach like multiple 
comparisons analysis when the units of observation cannot be selected 
randomly. This parallels the desirability of using analysis of covarlance 
over multlfacet analysis of variance when randomization Is not. possible and 
groups may be Initially different. Also, no manipulation of the series In 
terms of seasonal adjustment Is necessary. 



As In ANCOVA, one desires covarlates highly correlated with the 
dependent variable and not with each other. 
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